Speech Interface Exploiting Intentionally-Controlled Nonverbal Speech Information

نویسندگان

  • Masataka Goto
  • Katunobu Itou
  • Tetsunori Kobayashi
چکیده

This paper describes our research on speech interfaces using nonverbal speech information. Although speech information consists of verbal and nonverbal information, most speechrecognition research has made use of only verbal information such as words and sentences. From among nonverbal information, we have focused on hesitation (filled pause) and prosody (voice pitch) to create four speech-interface functions: Speech Completion, Speech Shift, Speech Starter, and Speech Spotter. Hesitation, for example, can be used as a trigger to complete an uttered fragment and pitch changing can be used to enter a word with it having different functions. By having users intentionally utter nonverbal information according to simple rules, we have achieved interfaces that can exploit the potential of speech in various forms. ACM Classification: H5.2 [Information interfaces and presentation]: User Interfaces. Voice I/O. General Terms: Design, Human Factors

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Interface Exploiting Nonverbal Information

This paper introduces our research on speech interfaces using nonverbal information and examines new possibilities in speech interfaces. Although speech information consists of verbal and nonverbal information, most speech-recognition research has made use of only verbal information. From among nonverbal information, we have focused on hesitation (filled pause) and prosody (voice pitch) to crea...

متن کامل

Speech shift: direct speech-input-mode switching through intentional control of voice pitch

This paper describes a speech-input interface function, called speech shift, that enables a user to specify a speech-input mode by simply changing (shifting) voice pitch. While current speech-input interfaces have used only verbal information, we aimed at building a more user-friendly speech interface by making use of nonverbal information, the voice pitch. By intentionally controlling the pitc...

متن کامل

Speech Completion: New Speech Interface with On-demand Completion Assistance

This paper describes a novel speech interface function, called speech completion, that helps a user enter a word or phrase by completing (filling in the rest of) a phrase fragment uttered by the user. Although the concept of completion has been widely used in text-based interfaces, effective completion for speech has not been proposed. We enable a user to invoke the speech-completion function i...

متن کامل

An Acoustic Study of Emotivity-Prosody Interface in Persian Speech Using the Tilt Model

This paper aims to explore some acoustic properties (i.e. duration and pitch amplitude of speech) associated with three different emotions: anger, sadness and joy against neutrality as a reference point, all being intentionally expressed by six Persian speakers. The primary purpose of this study is to find out if there is any correspondence between the given emotions and prosody patterning in P...

متن کامل

An intentional stance modulates the integration of gesture and speech during comprehension.

The present study investigates whether knowledge about the intentional relationship between gesture and speech influences controlled processes when integrating the two modalities at comprehension. Thirty-five adults watched short videos of gesture and speech that conveyed semantically congruous and incongruous information. In half of the videos, participants were told that the two modalities we...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005